Short-Text Semantic Similarity (STSS): Techniques, Challenges and Future Perspectives

نویسندگان

چکیده

In natural language processing, short-text semantic similarity (STSS) is a very prominent field. It has significant impact on broad range of applications, such as question–answering systems, information retrieval, entity recognition, text analytics, sentiment classification, and so on. Despite their widespread use, many traditional machine learning techniques are incapable identifying the semantics short text. Traditional methods based ontologies, knowledge graphs, corpus-based methods. The performance these influenced by manually defined rules. Applying measures still difficult, since it poses various challenges. existing literature, most recent advances in research not included. This study presents systematic literature review (SLR) with aim to (i) explain sentence barriers similarity, (ii) identify appropriate standard deep for text, (iii) classify models that produce high-level contextual information, (iv) determine datasets only intended (v) highlight challenges proposed future improvements. To best our knowledge, we have provided an in-depth, comprehensive, trends, which will assist researchers reuse enhance information.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Benchmarking short text semantic similarity

Short Text Semantic Similarity measurement is a new and rapidly growing field of research. “Short texts” are typically sentence length but are not required to be grammatically correct. There is great potential for applying these measures in fields such as Information Retrieval, Dialogue Management and Question Answering. A dataset of 65 sentence pairs, with similarity ratings, produced in 2006 ...

متن کامل

Text-to-Text Semantic Similarity for Automatic Short Answer Grading

In this paper, we explore unsupervised techniques for the task of automatic short answer grading. We compare a number of knowledge-based and corpus-based measures of text similarity, evaluate the effect of domain and size on the corpus-based measures, and also introduce a novel technique to improve the performance of the system by integrating automatic feedback from the student answers. Overall...

متن کامل

Down Syndrome: Current Status, Challenges and Future Perspectives

Down syndrome (DS) is a birth defect with huge medical and social costs, caused by trisomy of whole or part of chromosome 21. It is the most prevalent genetic disease worldwide and the common genetic cause of intellectual disabilities appearing in about 1 in 400-1500 newborns. Although the syndrome had been described thousands of years before, it was named after John Langdon Down who described ...

متن کامل

A Comparative Study of Two Short Text Semantic Similarity Measures

This paper describes a comparative study of STASIS and LSA. These measures of semantic similarity can be applied to short texts for use in Conversational Agents (CAs). CAs are computer programs that interact with humans through natural language dialogue. Business organizations have spent large sums of money in recent years developing them for online customer selfservice, but achievements have b...

متن کامل

ECNUCS: Measuring Short Text Semantic Equivalence Using Multiple Similarity Measurements

This paper reports our submissions to the Semantic Textual Similarity (STS) task in ∗SEM Shared Task 2013. We submitted three Support Vector Regression (SVR) systems in core task, using 6 types of similarity measures, i.e., string similarity, number similarity, knowledge-based similarity, corpus-based similarity, syntactic dependency similarity and machine translation similarity. Our third syst...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied sciences

سال: 2023

ISSN: ['2076-3417']

DOI: https://doi.org/10.3390/app13063911